22 research outputs found

    An efficiency curve for evaluating imbalanced classifiers considering intrinsic data characteristics: Experimental analysis

    Get PDF
    Balancing the accuracy rates of the majority and minority classes is challenging in imbalanced classification. Furthermore, data characteristics have a significant impact on the performance of imbalanced classifiers, which are generally neglected by existing evaluation methods. The objective of this study is to introduce a new criterion to comprehensively evaluate imbalanced classifiers. Specifically, we introduce an efficiency curve that is established using data envelopment analysis without explicit inputs (DEA-WEI), to determine the trade-off between the benefits of improved minority class accuracy and the cost of reduced majority class accuracy. In sequence, we analyze the impact of the imbalanced ratio and typical imbalanced data characteristics on the efficiency of the classifiers. Empirical analyses using 68 imbalanced data reveal that traditional classifiers such as C4.5 and the k-nearest neighbor are more effective on disjunct data, whereas ensemble and undersampling techniques are more effective for overlapping and noisy data. The efficiency of cost-sensitive classifiers decreases dramatically when the imbalanced ratio increases. Finally, we investigate the reasons for the different efficiencies of classifiers on imbalanced data and recommend steps to select appropriate classifiers for imbalanced data based on data characteristics.National Natural Science Foundation of China (NSFC) 71874023 71725001 71771037 7197104

    How electronic word of mouth dynamically influences product sales and supplies: an evidence from China film industry

    Get PDF
    As an important part of e-commerce, online reviews play a significant role in consumers’ purchase decisions. This study investigated the dynamic effects of electronic word of mouth (eWOM) and the number of people wishing to watch a movie on movie sales and supplies in the Chinese movie market. Using a dynamic simultaneous equation system and data of 76 films, this study analyzed the interrelationships between eWOM and movie sales and supplies. Our findings showed that both the volume and valence of eWOM affected movie sales and supplies significantly. The number of people who wanted to watch a movie had an opposite effect on movie sales and supply; eWOM volume had a positive impact on movie sales and supplies; and eWOM valence had a negative impact on movie sales and a positive impact on movie supplies. The number of people who wish to watch a movie was another important variable for movie sales and supplies, and it had a negative impact on the daily movie sales but a positive impact on the daily movie supplies. This study provided a detailed explanation of these results and thus contributed to improving the efficiency of movie suppliers’ utilization of online reviews

    A Similarity Measure-based Optimization Model for Group Decision Making with Multiplicative and Fuzzy Preference Relations

    Get PDF
    Group decision making (GDM) problem based on different preference relations aims to obtain a collective opinion based on various preference structures provided by a group of decision makers (DMs) or experts, those who have varying backgrounds and interests in real world. The decision process in proposed question includes three steps: integrating varying preference structures, reaching consensus opinion, selecting the best alternative. Two major approaches: preference transformation and optimization methods have been developed to deal with the issue in first step. However, the transformation processes causes information lose and existing optimization methods are so computationally complex that it is not easy to be used by management practice. This study proposes a new consistency-based method to integrate multiplicative and fuzzy preference relations, which is based on a cosine similarity measure to derive a collective priority vector. The basic idea is that a collective priority vector should be as similar per column as possible to a pairwise comparative matrix (PCM) in order to assure the group preference has highest consistency for each decision makers. The model is computationally simple, because it can be solved using a Lagrangian approach and obtain a collective priority vector following four simple steps. The proposed method can further used to derive priority vector of fuzzy AHP. Using three illustrative examples, the effectiveness and simpleness of the proposed model is demonstrated by comparison with other methods. The results show that the proposed model achieves the largest cosine values in all three examples, indicating the solution is the nearest theoretical perfectly consistent opinion for each decision makers

    Machine learning methods for systemic risk analysis in financial sectors.

    Get PDF
    Financial systemic risk is an important issue in economics and financial systems. Trying to detect and respond to systemic risk with growing amounts of data produced in financial markets and systems, a lot of researchers have increasingly employed machine learning methods. Machine learning methods study the mechanisms of outbreak and contagion of systemic risk in the financial network and improve the current regulation of the financial market and industry. In this paper, we survey existing researches and methodologies on assessment and measurement of financial systemic risk combined with machine learning technologies, including big data analysis, network analysis and sentiment analysis, etc. In addition, we identify future challenges, and suggest further research topics. The main purpose of this paper is to introduce current researches on financial systemic risk with machine learning methods and to propose directions for future work.This research has been partially supported by grants from the National Natural Science Foundation of China (#U1811462, #71874023, #71771037, #71725001, and #71433001)

    Behavior monitoring methods for trade-based money laundering integrating macro and micro prudential regulation: a case from China

    Get PDF
    Trade-based Money Laundering, a new form of money laundering using international trade as a signboard, always appears along with speculative capital movement which has been accepted as the most concerned and consensus incentive giving rise to the collapse of the financial market. Unfortunately, preventing money laundering is very difficult since money laundering always has a plausible trade characterization. To reach this goal, supervision for regulator and financial institutions aims to effectively monitor micro entities’ behavior in financial markets. The main purpose of this paper is to establish a monitoring method including accurate recognition and classified supervision for Trade-based Money Laundering by means of knowledge-driven multi-class classification algorithms associated with macro and micro prudential regulation, such that the model can forecast the predicted class from the concerned management areas. Based on empirical data from China, we demonstrate the application and explain how the monitor method can help to improve management efficiency in the financial market. First published online 8 May 201

    Machine learning methods for systemic risk analysis in financial sectors

    Get PDF
    Financial systemic risk is an important issue in economics and financial systems. Trying to detect and respond to systemic risk with growing amounts of data produced in financial markets and systems, a lot of researchers have increasingly employed machine learning methods. Machine learning methods study the mechanisms of outbreak and contagion of systemic risk in the financial network and improve the current regulation of the financial market and industry. In this paper, we survey existing researches and methodologies on assessment and measurement of financial systemic risk combined with machine learning technologies, including big data analysis, network analysis and sentiment analysis, etc. In addition, we identify future challenges, and suggest further research topics. The main purpose of this paper is to introduce current researches on financial systemic risk with machine learning methods and to propose directions for future work. First published online 6 May 201

    An efficient consensus reaching framework for large-scale social network group decision making and its application in urban resettlement

    Get PDF
    The authors thank the editor and the anonymous referees for their valuable comments and insightful recommendations. This work was supported in part by grants from the National Natural Science Foundation of China (#71874023, #71771037, #71971042, #71910107002, and #71725001) and supported by the Spanish State Research Agency under Project PID2019-103880RB-I00/AEI/10.13039/501100011033.Urban resettlement projects involve a large number of stakeholders and impose tremendous cost. Developing resettlement plans and reaching an agreement amongst stakeholders about resettlement plans at a reasonable cost are some of the key issues in urban resettlement. From this perspective, urban resettlement is a typical large-scale group decision-making (GDM) problem, which is challenging because of the scale of participants and the requirement of high consensus levels. Observing that residents who are affected by a resettlement project often have tight social connections, this study proposes a framework to improve the consensus reaching and uses the minimum consensus cost to reduce the total cost for urban resettlement projects with more than 1000 participants. Firstly, we construct a network topology that consists of two layers to deal with incomplete social relationships amongst large-scale participants. An inner layer consists of participants whose preference similarities and trust relations are known. Meanwhile, an outside layer includes participants whose trust relations cannot be determined. Secondly, we develop a classification method to classify participants into small subgroups based on their preference similarities. We can then connect the participants whose trust relations are unknown (the outside layer) with the ones in the inner layer using the classification results. To facilitate effective consensus reaching in large-scale social network GDM, we develop a threestep approach to reconcile conflicting preferences and accelerate the consensus process at the minimum cost. A real-life urban resettlement example is used to validate the proposed approach. Results show that the proposed approach can reduce the total consensus cost compared with the other two practices used in the actual urban resettlement operations.National Natural Science Foundation of China (NSFC) 71874023 71771037 71971042 71910107002 71725001Spanish State Research Agency PID2019-103880RB-I00/AEI/10.13039/50110001103

    Large-scale group decision-making with non-cooperative behaviors and heterogeneous preferences: An application in financial inclusion

    Get PDF
    The authors thank the editor and anonymous referees for their valuable comments and insightful recommendations, and thank Prof. Yucheng Dong for the helpful suggestions. This research was supported in part by grants from the National Natural Science Foundation of China (#71874023, #U1811462, #71725001, #71771037, #71971042, #71910107002) and Major project of the National Social Science Foundation of China (#19ZDA092). Enrique Herrera-Viedma is supported by the FEDER funds in the project TIN2016-75850-R.Non-cooperative behavior is a common situation in large-scale group decision-making (LSGDM) problems. In addition, decision makers in LSGDM often use different preference formats to express their opinions, due to their educational backgrounds, knowledge, and experiences. Heterogeneous preference information and non-cooperative behaviors bring challenges to LSGDM. This study develops a consensus reaching model to address heterogeneous LSGDM with non-cooperative behaviors and discuss its application in financial inclusion. Specifically, the cosine similarity degree is introduced to build a distance measure for different preference structures. Clustering analysis is employed to divide large-scale groups and handle non-cooperative behaviors in LSGDM. A consensus degree and a weighting process are proposed to decrease the influence of non-cooperative behaviors and facilitate the consensus reaching process. The convergence of the proposed approach is proven by theoretical and simulation analyses. Experimental studies are carried out to compare the performances of the proposed approach with existing methods. Finally, a real-life example from the “targeted poverty reduction project” in China is presented to validate the proposed approach. The selection of beneficiaries in finance inclusion is difficult due to the lack of credit history, the large number of participants, and the conflicting views of participants. The results showed that the proposed consensus model can integrate opinions of participants using diverse preference formats and reach an agreement efficiently.National Natural Science Foundation of China (NSFC) 71874023 U1811462 71725001 71771037 71971042 71910107002Major project of the National Social Science Foundation of China 19ZDA092European Union (EU) TIN2016-75850-

    A geometrical method for consensus building in GDM with incomplete heterogeneous preference information

    Get PDF
    This work was supported in part by grants from the National Natural Science Foundation of China (#71874023, #71725001, #71910107002, #71771037, #71971042), and European Regional Development Fund (FEDER), European Union funds in the project TIN2016-75850-R.In real-life group decision-making (GDM) problems, the preferences given by decision-makers(DMs) are often incomplete, because the complexity of decision-making problems and the limitation of knowledge of DM make it difficult for DMs to take a determined evaluation of alternatives. In addition, preference relations provided by DMs are often heterogeneous because they always have different decision habits and hobbies. However, the consensus method for GDM under incomplete heterogeneous preference relations is rarely studied. For four common preference relations: utility values, preference orderings, and (incomplete) multiplicative preference relations and (incomplete) fuzzy preference relations, this paper proposes a geometrical method for consensus building in GDM. Specifically, we integrate incomplete heterogeneous preference structures using a similarity-based optimization model and set a corresponding geometrical consensus measurement. Then, preference modification and weighting processes are proposed to improve consensus degree. Finally, we conduct a comparison analysis based on a qualitative analysis and algorithm complexity analysis of existing consensus reaching methods. Numerical analyses and convergence tests show that our method can promote the improvement of the consensus degree in GDM, and has less time complexity than the previous methods. The proposed geometrical method is a more explainable model due to operability and simplicity.National Natural Science Foundation of China (NSFC) 71874023 71725001 71910107002 71771037 71971042European Regional Development Fund (FEDER), European Union TIN2016-75850-
    corecore